Towards Efficient String Processing of Annotated Events
نویسندگان
چکیده
This paper explores the use of strings as models to effectively represent event data such as might be found in a document annotated with ISO-TimeML. We describe the translation of such data to strings, as well as a number of operations, such as superposition, which may be used to manipulate these strings in order to infer new information. Some advantages and limitations of the operations are discussed, including issues of over-generation, which can be mitigated though the use of suitable constraints. In particular, we look at how Allen Relations, which might be extracted from a document annotated with ISO-TimeML, can be understood as useful constraints, and translated to strings.
منابع مشابه
Extracting ’Significant’ Patterns from Musical Strings: Some Interesting Problems
In this paper a number of issues relating to the application of string processing techniques on musical sequences are discussed. Special attention is given to musical pattern extraction. Firstly, a number of general problems are presented in terms of musical representation and pattern processing methodologies. Then a number of interesting melodic pattern matching problems are presented. Finally...
متن کاملSome Thoughts on Using Annotated Suffix Trees for Natural Language Processing
The paper defines an annotated su x tree (AST) a data structure used to calculate and store the frequencies of all the fragments of the given string or a collection of strings. The AST is associated with a string to text scoring, which takes all fuzzy matches into account. We show how the AST and the AST scoring can be used for Natural Language Processing tasks.
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملTowards Unsupervised Learning of Temporal Relations between Events
Automatic extraction of temporal relations between event pairs is an important task for several natural language processing applications such as Question Answering, Information Extraction, and Summarization. Since most existing methods are supervised and require large corpora, which for many languages do not exist, we have concentrated our efforts to reduce the need for annotated data as much a...
متن کاملJAAE: the java abstract annotation editor
Recent trends in NLP (Natural Language Processing) are heading towards a stochastic processing of natural language. Stochastic methods, however, usually demand a lot of annotated training data. In most cases, the annotation of the data has to be done manually by a team of annotators and it is a highly timeconsuming and expensive process. Thus we tried to develop an efficient and user-friendly e...
متن کامل